Reconciliation Revisited: Handling Multiple Optima When Reconciling with Duplication, Transfer, and Loss
نویسندگان
چکیده
Phylogenetic tree reconciliation is a powerful approach for inferring evolutionary events like gene duplication, horizontal gene transfer, and gene loss, which are fundamental to our understanding of molecular evolution. While duplication-loss (DL) reconciliation leads to a unique maximum-parsimony solution, duplication-transfer-loss (DTL) reconciliation yields a multitude of optimal solutions, making it difficult to infer the true evolutionary history of the gene family. This problem is further exacerbated by the fact that different event cost assignments yield different sets of optimal reconciliations. Here, we present an effective, efficient, and scalable method for dealing with these fundamental problems in DTL reconciliation. Our approach works by sampling the space of optimal reconciliations uniformly at random and aggregating the results. We show that even gene trees with only a few dozen genes often have millions of optimal reconciliations and present an algorithm to efficiently sample the space of optimal reconciliations uniformly at random in O(mn(2)) time per sample, where m and n denote the number of genes and species, respectively. We use these samples to understand how different optimal reconciliations vary in their node mappings and event assignments and to investigate the impact of varying event costs. We apply our method to a biological dataset of approximately 4700 gene trees from 100 taxa and observe that 93% of event assignments and 73% of mappings remain consistent across different multiple optima. Our analysis represents the first systematic investigation of the space of optimal DTL reconciliations and has many important implications for the study of gene family evolution.
منابع مشابه
Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss
MOTIVATION Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, wh...
متن کاملEfficient Algorithms for the Reconciliation Problem with Gene Duplication, Horizontal Transfer and Loss Citation
Motivation: Gene family evolution is driven by evolutionary events such as speciation, gene duplication, horizontal gene transfer and gene loss, and inferring these events in the evolutionary history of a given gene family is a fundamental problem in comparative and evolutionary genomics with numerous important applications. Solving this problem requires the use of a reconciliation framework, w...
متن کاملA Reconciliation with Non-binary Gene Trees Revisited
By reconciling the phylogenetic tree of a gene family with the corresponding species tree, it is possible to infer lineage-specific duplications and losses with high confidence and hence to annotate orthologs and paralogs. The currently available reconciliation methods for non-binary gene trees are computationally expensive for genome-scale applications. We present four O(|G| + |S|) algorithms ...
متن کاملOn the Impact of Uncertain Gene Tree Rooting on Duplication-Transfer-Loss Reconciliation
Duplication-Transfer-Loss (DTL) reconciliation is a powerful and increasingly popular technique for studying the evolution of microbial gene families. DTL reconciliation requires the use of rooted gene trees to perform the reconciliation with the species tree, and the standard technique for rooting gene trees is to assign a root that results in minimum reconciliation cost across all rootings of...
متن کاملReconciliation of Gene and Species Trees With Polytomies
Motivation: Millions of genes in the modern species belong to only thousands of gene families. Genes duplicate and are lost during evolution. A gene family includes instances of the same gene in different species and duplicate genes in the same species. Two genes in different species are ortholog if their common ancestor lies in the most recent common ancestor of the species. Because of complex...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of computational biology : a journal of computational molecular cell biology
دوره 20 10 شماره
صفحات -
تاریخ انتشار 2013